
#import "template.typ" : *

#show: doc => simple-slide(doc, 
  subtitle: "",
  author: "Yue Tao"
)

// Title slide
// = Characterization & Identification of Cluster Workloads in LLM Applications: A Survey
= Workload Characterization in LLM Applications: A Survey

= Experimental Agentic RAG System
#figure(
  image("experimental/structure.svg", width: 45%),
  caption: [Experimental Agentic RAG System]
)

= Experimental Agentic RAG System

*System Architecture*: A multi-agent RAG system has been built with three main components:

*AGENTS*:
- Task Router / Info Collector: Central orchestration
- Document Agent: Handles document processing and retrieval
- Monitoring Agent: System performance tracking
- Web Search Agent: External information gathering

*SERVICES*:
- LLMs: Qwen3-Instruct-8B (8B parameters) + Qwen3-Embedding-0.6B (0.6B parameters)
- Databases: PostgreSQL + PGVector (vector storage), Redis (caching)

*HARDWARE*:
- Two RTX 4090 GPUs (48GB in total) for LLM inference
- CPU/Memory/Disk/Network for database services







= 😊

#align(center)[
  #v(2em)
  #text(size: 2.5em, weight: "bold")[Thank You!]
  
  #v(1em)
  #text(size: 1.5em)[Questions & Discussion]
  
]